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Amendments to the Claims 

This listing of claims will replace all prior versions of claims in the application: 
Listing of Claims: 

1 . (Currently Amended) A computer implemented normalization system, comprising the 
following computer executable components : 

an interface component that receives data corresponding to a heterogeneous knowledge 
base, and 

a normalization component that applies a utility model that predicts accuracy or quality 
of results to provide a regularized understanding of the knowledge base ; and 

a preference component that enables users to assess or select various parameters that 
influence the utility model the preference component processes at least one of a user setting for 
a cost, a value, and a language preference, the preference component includes a model where a 
user assesses a parameter v, indicating a dollar value of receiving a correct answer to a question, 
and where a parameter c represents a cost of each query rewrite submitted to a search engine . 

2. (Original) The system of claim 1, the interface component processes questions posed by 
users. 

3. (Original) The system of claim 1, the utility model dynamically controls extraction of 
previously unknown or disassociated information from the knowledge base. 

4. (Original) The system of claim 1, the utility model controls a number of queries 
submitted to the knowledge base given decision-theoretic considerations. 

5. (Currently Amended) The system of claim 1, the knowledge base includes at least one of 
a local database, a file, a directory, an electronic encyclopedia, a dictionary, a remote database, 
and a remote web site. 
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6. (Original) The system of claim 1, where the knowledge base is the World Wide Web. 

7. (Original) The system of claim 1 , the utility model applies a cost-benefit analysis to 
dynamically control the number and types of attempts made to acquire information or answers 
from the knowledge base in response to a question or questions. 

8. (Original) The system of claim 7 5 the utility model includes an analysis of the costs of 
searching for information versus the benefits or value of obtaining more accurate answers to 
questions. 

9. (Original) The system of claim 1, the interface component initiates a dialog with users 
based upon predetermined probability thresholds or other criteria that includes a cost-benefit 
analyses that considers when it would be best to ask a user to reformulate a question rather than 
expending effort on processing a query that may be expensive in terms of searching for 
information from the knowledge base or likely to yield inaccurate results. 

10. (Original) The system of claim 9, the dialog is initiated from an assessment of a cost of 
delay and effort associated with a query reformulation and a likelihood that a reformulation 
would lead to an improved result. 

11. (Cancelled) 

12. (Cancelled) 

13. (Cancelled) 

14. (Currently Amended) The system of claim 1 [[3]] 5 further comprising a value of receiving 
an answer expressed as a function of details of a current context, the value of the answer is 
linked to at least one of a type of question, an informational goal, and a time of day for a user. 
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15. (Currently Amended) The system of claim 1 [[3]], further comprising determining a cost 
of submitting queries as a function of at least one of a current load sensed on a search engine or 
the numbers of queries being submitted by a user's entire organization to a third-party search 
service. 

1 6. (Original) The system of claim 15, further comprising determining the costs non-linearly 
with increasing numbers of queries. 

17. (Original) The system of claim 16 ? further comprising assessing costs of n at a first level 
of cost by a search service supporting a question-answering system at an enterprise, after which 
expenses are incurred in a supra-linear manner, n being an integer. 

18. (Original) The system of claim 1, the utility model determines an expected value of 
submitting n queries as a product of a likelihood of an answer, given evidence E about a query, 
p(A\E,n 9 §, and a value of obtaining a correct answer v, p(A\E,n,& v. 

19. (Original) The system of claim 1 8, the value of an answer, v, is assessed in terms of the 
cost of queries, c as a multiple k of the cost of each query c, v=kc. 

20. (Original) The system of claim 19, further comprising a cost model that grows linearly 
with a number of queries, nc. 

21 . (Original) The system of claim 20, further comprising a utility model that determines an 
ideal number of queries to submit, optimizes a net expected value, computed as a difference of 
the expected value and cost, for different w. 

| 

22. (Original) The system of claim 21, the ideal number of queries is expressed as: 
n*= arg mdx n p(A\E,n 9 £)kc - nc. 

23. (Original) A computer readable medium having computer readable instructions stored 
thereon for implementing the interface component and the normalization component of claim 1 . 

I 
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24. (Currently Amended) A computer implemented method to normalize a database, 
comprising the following computer executable acts : 

automatically forming a set of queries from a question posed by a user; and 
performing a cost-benefit analysis on the set of queries to generate a query subsetjhe 
cost benefit analysis enables users to assess or select various parameters that influence a utility 
model the utility model processes a user setting for a cost, the utility model allows a user to 
assess a parameter v. indicating a dollar value of receiving a correct answer to a q uestion, and 
where a parameter c represents a cost of each query rewrite submitted to a search engine . 

25. (Original) The method of claim 24, further comprising automatically ranking the set of 
queries in an order of likelihood of providing a suitable answer. 

26. (Currently Amended) The method of claim 24, further comprising automatically training 
at least one model to generate for the query subset. 

27. (Original) The method of claim 24, further comprising submitting the query subset to at 
least one search engine. 

28. (Original) The method of claim 27, further comprising receiving results from the at least 
one search engine and automatically composing an answer. 

29. (Currently Amended) A computer implemented system to facilitate database 
normalization, comprising: 

computer implemented means for formulating a query set from a user question; and 
computer implemented means for forming a query subset from the query set based at 
least in part on a utility model employed for normalizing the database ; and 

computer implemented means for enabling users to assess or select various parameters 
that influence the utility model the utility model processes a user setting for a cost, the utility 
model allows a user to assess a parameter v, indicating a dollar value of receiving a correct 
answer to a question, and where a parameter c represents a cost of each query rewrite submitted 
to a search engine . 
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30. (Currently Amended) A computer implemented question-answering system, comprising 
the following computer executable components : 

a rewriting component that receives a user query and automatically formulates a set of 
queries; and 

a cost-benefit component to reduce the set of queries based upon an analysis of expected 
gains in accuracy of an answer in view of associated costs for additional queries , the cost-benefit 
component enables users to assess or select various parameters that influence a utility model the 
utility model processes a user setting for a cost, the utility model allows a user to assess a 
parameter v, indicating a dollar value of receiving a correct answer to a question, and where a 
parameter c represents a cost of each query rewrite submitted to a search engine . 

3 1 . (Original) The system of claim 30, further comprising a ranking component to determine 
an ordering for the set of queries. 

32. (Original) The system of claim 30, further comprising an answer composer to formulate 
an answer from the reduced set of queries. 

33. (Original) The system of claim 30, further comprising at least one search engine to 
process the set of queries. 

34. (Original) The system of claim 30, further comprising a component to process foil text 
and/or text summaries of articles returned by a search engine. 

35. (Original) The system of claim 30, further comprising a component that learns logical or 
statistical predictive models that predict an accuracy or quality of answers as a function of nature 
or number of queries issued to a knowledge base. 

36. (Original) The system of claim 30, further comprising a component that learns logical or 
statistical predictive models that predict an age appropriateness of answers as a function of 
nature or number of queries issued to a knowledge base. 
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37. (Original) The system of claim 35, the knowledge base includes at least one search 
engine. 

3 8 . (Original) The system of claim 3 5 , the models employ Bayesian learning procedures. 

39. (Original) The system of claim 30, further comprising a component to analyze at least 
one feature, the feature including at least one of a conjunctional and a phrasal rewrite. 

40. (Original) The system of claim 39, the feature includes at least one of a longest phrase, a 
word length, a number of capital letters, a number of phrases, a number of stop words, a number 
of words, a percentage of stop words, a number of primary parses, a number of secondary parses, 
and a measure of grammatical suitability. 

41 . (Original) The system of claim 39, further comprising higher-level features including a 
distribution of topics associated with results of queries. 

42. (Currently Amended) The system of claim 4 1 , the features are identified with a statistical 
classifier that assigns topics based on text being analyzed.[[,]] 

43 . (Original) The system of claim 3 9, further comprising tags that are derived from natural- 
language parses of initial questions, and text of results or text of snippets returned from the 
results. 

44. (Original) The system of claim 39, further comprising a component to derive higher-level 
informational goals of a user, as derived from assessing goals directly, or as inferred from an 
analysis of the user's initial question. 

45. (Original) The system of claim 39, further comprising classes of features including 
attributes and statistics of attributes of morphological or semantic aspects of an initial (1) 
question and/or (2) one or more query results. 
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46. (Original) The system of claim 45, the classes include at least one of words and phrases, 
parts of speech, structure of natural language parse, length, topics and distribution of topics, and 
inferred or assessed informational goals or intentions. 

47. (Currently Amended) A computer implemented normalization system, comprising the 
following computer implemented components : 

an interface component that receives data corresponding to a heterogeneous knowledge 
base^ and 

a normalization component that applies a model that predicts accuracy or quality of 
results in conjunction with a utility model to provide a regularized understanding of the value of 
performing different information extraction actions from the knowledge base ; and 

a preference component that enables users to assess or select various parameters that 
influence the utility model the preference component processes at least one of a user setting for 
a cost, a value, and a language preference, the preference component includes a model where a 
user assesses a parameter v. indicating a dollar value of receiving a correct answ er to a question. 
and where a parameter c represents a cost of each query rewrite submitted to a search engine . 

48. (Original) The system of claim 47, the interface component processes questions posed by 
users. 

49. (Original) The system of claim 47, further comprising a dialog component that makes a 
decision when to engage a user to request a reformulated question or additional information. 

50. (Original) The system of claim 49, the dialog component alerts the user about the cost of 
receiving a good answer, or recommends that a query be attempted elsewhere. 
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